Multi-pitch trajectory estimation of concurrent speech based on harmonic GMM and nonlinear kalman filtering
نویسندگان
چکیده
This paper describes a multi-pitch tracking algorithm of 1-channel simultaneous multiple speech. The algorithm selectively carries out the two alternative processes at each frame: frame-independent-process and framedependent-process. The former is the one we have previously proposed[6], that gives good estimates of the number of speakers and F0s with a single-frame-processing. The latter corresponds to the topic mainly described in this paper, that recursively tracks F0s using nonlinear Kalman filtering. We tested our algorithm on simultaneous speech signal data and showed higher performance than when the frame-independent-process was only used.
منابع مشابه
Approximate Kalman Filtering for the Harmonic plus Noise Model
We present a probabilistic description of the Harmonic plus Noise Model (HNM) for speech signals. This probabilistic formulation permits Maximum Likelihood (ML) parameter estimation and speech synthesis becomes a straightforward sampling from a distribution. It also permits development of a Kalman filter that tracks model parameters such as pitch, harmonic amplitudes, and autoregressive coeffic...
متن کاملKalman tracking of linear predictor and harmonic noise models for noisy speech enhancement
This paper presents a speech enhancement method based on the tracking and denoising of the formants of a linear prediction (LP) model of the spectral envelope of speech and the parameters of a harmonic noise model (HNM) of its excitation. The main advantages of tracking and denoising the prominent energy contours of speech are the efficient use of the spectral and temporal structures of success...
متن کاملMulti-pitch estimation by a joint 2-d representation of pitch and pitch dynamics
Multi-pitch estimation of co-channel speech is especially challenging when the underlying pitch tracks are close in pitch value (e.g., when pitch tracks cross). Building on our previous work in [1], we demonstrate the utility of a two-dimensional (2-D) analysis method of speech for this problem by exploiting its joint representation of pitch and pitch-derivative information from distinct speake...
متن کاملImproving YANGsaf F0 Estimator with Adaptive Kalman Filter
We present improvements to the refinement stage of YANGsaf[1] (Yet ANother Glottal source analysis framework), a recently published F0 estimation algorithm by Kawahara et al., for noisy/breathy speech signals. The baseline system, based on time-warping and weighted average of multi-band instantaneous frequency estimates, is still sensitive to additive noise when none of the harmonic provide rel...
متن کاملOptimal Estimation of Harmonic Components Using ISFLA
In this paper a novel method based on evolutionary algorithms is presented to estimate the harmonic components. In general, the optimization of the harmonic estimation process is a multi-component problem, in which evaluation of the phase and harmonic frequency is the nonlinear part of the problem and is solved based on the mathematical and evolutionary methods; while estimation of amplitude of...
متن کامل